Why Search Engines are Used Increasingly to Offload Queries from Databases

نویسنده

  • Bjørn Olstad
چکیده

The development of future search engine technology is no longer limited to free text. Rather, the aim is to build core indexing services that focus on extreme performance and scalability for retrieval and analysis across structured and unstructured data sources alike. In addition, binary query evaluation is being replaced with advanced frameworks that provide both fuzzy matching and ranking schemes, to separate value from noise. As another trend, analytical applications are being enabled by the computation of contextual concept relationships across billions of documents/records on-the-fly. Based on these developments in search engine technology, a set of new information retrieval infrastructure patterns are appearing: 1. the mirroring of DB content into a search engine in order to improve query capacity and user experience, 2. the use of search engine technology as the default access pattern to both structured and unstructured data in applications such as CRM and storage and document management, and 3. a paradigm shift is predicted in business intelligence. The presentation will review key trends from search engine development and relate these to concrete user scenarios. About the Speaker Bjørn Olstad is the CTO in FAST Search & Transfer and an adjunct professor at the Norwegian University of Science and Technology (NTNU). FAST has emerged as the leading provider of Enterprise Search Platforms (ESP). The FAST ESP platform has been embedded as the information access layer in applications such as Siebel, EMC Storage and Documentum. Companies like ReedElsevier, IBM, Dell, AOL, Factiva and Reuters use FAST ESP to power information retrieval and analytics solutions. Before joining FAST Olstad has been a professor at NTNU and headed development at GE Healthcare, Cardiac Ultrasound. Bjørn Olstad has published more than 70 research papers and he has been granted more than 30 patents. Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment Proceedings of the 31 VLDB Conference, Trondheim, Norway, 2005

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مدل جدیدی برای جستجوی عبارت بر اساس کمینه جابه‌جایی وزن‌دار

Finding high-quality web pages is one of the most important tasks of search engines. The relevance between the documents found and the query searched depends on the user observation and increases the complexity of ranking algorithms. The other issue is that users often explore just the first 10 to 20 results while millions of pages related to a query may exist. So search engines have to use sui...

متن کامل

On Low Overlap among Search Results of Academic Search Engines

Number of published scholarly articles is growing exponentially. To tackle this information overload, researchers are increasingly depending on niche academic search engines. Recent works have shown that two major general web search engines: Google and Bing, have high level of agreement in their top search results. In contrast, we show that various academic search engines have low degree of agr...

متن کامل

بررسی میزان همخوانی عبارت‌های جستجوی کاربران با اصطلاحات پیشنهادی مقالات در پیشینه‌های کتابشناختی پایگاه‌های اطلاعاتی لاتین EBSCO و IEEE

Purpose: This study aims to investigate correspondence of users' queries with alternative terms of Latin databases namely IEEE and EBSCO. Databases display subjective content of their documents through natural or controlled language vocabularies in specified bibliographic fields along with other bibliographic information that are called papers alternative terms. Methodology: We used content an...

متن کامل

The Impact of the Objective Complexity and Product of Work Task on Interactive Information Searching Behavior

Background and Aim: this study aimed to explore the impact of objective complexity and Product of work task on user's interactive information searching behavior. Method: The research population consisted of MSc students of Ferdowsi university of Mashhad enrolled in 2012-13 academic year. In 3 stages of sampling (random stratified, quota, and voluntary sampling), 30 cases were selected. Each of ...

متن کامل

Review of ranked-based and unranked-based metrics for determining the effectiveness of search engines

Purpose: Traditionally, there have many metrics for evaluating the search engine, nevertheless various researchers’ proposed new metrics in recent years. Aware of this new metrics is essential to conduct research on evaluation of the search engine field. So, the purpose of this study was to provide an analysis of important and new metrics for evaluating the search engines. Methodology: This is ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005